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SEQUENCE LISTING 

<110> Ding, Shi -You 

Adney, William S. 
Vinzant, Todd B. 
Himmel , Michael E 
Decker, Stephen R 

<120> THERMAL TOLERANT CELLULASE FROM ACIDOTHERMUS CELLULOLYTICUS 

<130> NREL 01-37 

<140> US 09/917,383 
<141> 2001-07-28 

<160> 14 

<170> Patentln version 3.2 

<210> 1 

<211> 1228 

<212> PRT 

<213> Acidothermus cellulolyticus 
<220> 

<221> misc_feature 

<223> Guxa full-lengh protein sequence 

<400> 1 

Met Glu Arg Thr Gin Gin ser Gly Arg Asn Cys Arg Tyr Gin Arg Gly 
1 5 10 15 

*. 

Thr Thr Arg Met Pro Ala lie Ser Lys Arg Leu Arg Ala Gly Val Leu 
20 25 30 

Ala Gly Ala Val Ser lie Ala Ala Ser lie Val Pro Leu Ala Met Gin 
35 40 45 

His Pro Ala lie Ala Ala Thr His val Asp Asn Pro Tyr Ala Gly Ala 
50 55 60 

Thr Phe Phe Val Asn Pro Tyr Trp Ala Gin Glu val Gin Ser Glu Ala 
65 70 75 80 

Ala Asn Gin Thr Asn Ala Thr Leu Ala Ala Lys Met Arg val val Ser 
85 90 95 

Thr Tyr Ser Thr Ala val Trp Met Asp Arg lie Ala Ala lie Asn Gly 
100 105 110 

Val Asn Gly Gly Pro Gly Leu Thr Thr Tyr Leu Asp Ala Ala Leu Ser 
115 120 125 

Gin Gin Gin Gly Thr Thr Pro Glu Val lie Glu lie val lie Tyr Asp 
130 135 140 



Leu Pro Gly Arg Asp Cys Ala Ala Leu Ala Ser Asn Gly Glu Leu Pro 
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145 



150 



Sequence listing.txt 
155 



160 



Ala Thr Ala Ala Gly Leu Gin Thr Tyr Glu Thr Gin Tyr lie Asp Pro 
165 170 175 

lie Ala Ser lie Leu ser Asn Pro Lys Tyr Ser ser Leu Arg lie val 
180 185 190 

Thr lie lie Glu Pro Asp Ser Leu Pro Asn Ala Val Thr Asn Met Ser 
195 200 205 

lie Gin Ala Cys Ala Thr Ala val Pro Tyr Tyr Glu Gin Gly lie Glu 
210 215 220 

Tyr Ala Leu Thr Lys Leu His Ala lie Pro Asn val Tyr lie Tyr Met 
225 230 235 240 

Asp Ala Ala His Ser Gly Trp Leu Gly Trp Pro Asn Asn Ala Ser Gly 
245 250 255 

Tyr Val Gin Glu val Gin Lys val Leu Asn Ala Ser lie Gly val Asn 
260 265 270 

Gly lie Asp Gly Phe val Thr Asn Thr Ala Asn Tyr Thr Pro Leu Lys 
275 280 285 

Glu Pro Phe Met Thr Ala Thr Gin Gin val Gly Gly Gin Pro val Glu 
290 295 300 

Ser Ala Asn Phe Tyr Gin Trp Asn Pro Asp lie Asp Glu Ala Asp Tyr 
305 310 315 320 

Ala val Asp Leu Tyr Ser Arg Leu val Ala Ala Gly Phe Pro Ser Ser 
325 330 335 

lie Gly Met Leu lie Asp Thr Leu Arg Asn Gly Trp Gly Gly Pro Asn 
340 345 350 

Glu Pro Thr Gly Pro Ser Thr Ala Thr Asp Val Asn Thr Phe Val Asn 
355 360 365 

Gin Ser Lys lie Asp Leu Arg Gin His Arg Gly Leu Trp Cys Asn Gin 
370 375 380 

Asn Gly Ala Gly Leu Gly Gin Pro Pro Gin Ala Ser Pro Thr Asp Phe 
385 390 395 400 

Pro Asn Ala His Leu Asp Ala Tyr Val Trp lie Lys Pro Pro Gly Glu 
405 410 415 



Ser Asp Gly Thr Ser Ala Ala Ser Asp Pro Thr Thr Gly Lys Lys Ser 
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Sequence listing.txt 
425 430 



Asp Pro Met Cys Asp Pro Thr Tyr Thr Thr Ser Tyr Gly Val Leu Thr 
435 440 445 

Asn Ala Leu Pro Asn Ser Pro lie Ala Gly Gin Trp Phe Pro Ala Gin 
450 455 460 

Phe Asp Gin Leu val Ala Asn Ala Arg Pro Ala val Pro Thr Ser Thr 
465 470 475 480 

Ser Ser Ser Pro Pro Pro Pro Pro Pro Ser Pro ser Ala Ser Pro Ser 
485 490 495 



pro Ser Pro Ser pro Ser Pro Ser Ser Ser Pro Ser Pro Ser Pro Ser 
500 505 510 



Pro Ser Ser Ser Pro Ser Pro Ser Pro Ser Pro Ser Pro Ser Pro Ser 
515 520 525 



Ser Ser Pro Ser Pro Ser Pro Ser Ser Ser Pro Ser Pro Ser Pro Ser 
530 535 540 

Pro Ser Pro Ser Pro Ser Ser Ser Pro Ser Pro Ser Pro Ser Ser Ser 
545 550 555 560 

Pro Ser Pro Ser Pro ser Pro Ser Pro Ser Pro Ser Ser Ser Pro Ser 
565 570 575 

Pro Ser Pro Thr Ser Ser Pro val Ser Gly Gly Leu Lys Val Gin Tyr 
580 585 590 

Lys Asn Asn Asp Ser Ala Pro Gly Asp Asn Gin lie Lys Pro Gly Leu 
595 600 605 

Gin Leu Val Asn Thr Gly Ser Ser Ser val Asp Leu Ser Thr val Thr 
610 615 620 

Val Arg Tyr Trp Phe Thr Arg Asp Gly Gly Ser Ser Thr Leu val Tyr 
625 630 635 640 

Asn cys Asp Trp Ala Ala Met Gly Cys Gly Asn lie Arg Ala Ser Phe 
645 650 655 

Gly Ser val Asn Pro Ala Thr Pro Thr Ala Asp Thr Tyr Leu Gin Leu 
660 665 670 



Ser Phe Thr Gly Gly Thr Leu Ala Ala Gly Gly Ser Thr Gly Glu lie 
675 680 685 



Gin Asn Arg Val Asn Lys Ser Asp Trp ser Asn Phe Thr Glu Thr Asn 
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695 700 



Asp Tyr Ser Tyr Gly Thr Asn Thr Thr Phe Gin Asp Trp Thr Lys val 

705 710 715 720 

Thr val Tyr Val Asn Gly Val Leu Val Trp Gly Thr Glu Pro Ser Gly 

725 730 735 



Thr Ser Pro Ser Pro Thr Pro Ser Pro Ser Pro Ser Pro Ser Pro Ser 
740 745 750 

Pro Gly Gly Asp Val Thr Pro Pro Ser Val Pro Thr Gly Leu Val Val 
755 760 765 

Thr Gly val Ser Gly Ser Ser val Ser Leu Ala Trp Asn Ala Ser Thr 
770 775 780 

Asp Asn val Gly Val Ala His Tyr Asn Val Tyr Arg Asn Gly Val Leu 
785 790 795 800 

val Gly Gin Pro Thr Val Thr Ser Phe Thr Asp Thr Gly Leu Ala Ala 
805 810 815 

Gly Thr Ala Tyr Thr Tyr Thr val Ala Ala val Asp Ala Ala Gly Asn 
820 825 830 

Thr Ser Ala Pro Ser Thr Pro val Thr Ala Thr Thr Thr Ser Pro ser 
835 840 845 

Pro Ser Pro Thr Pro Thr Gly Thr Thr Val Thr Asp Cys Thr Pro Gly 
850 855 860 

Pro Asn Gin Asn Gly Val Thr Ser Val Gin Gly Asp Glu Tyr Arg Val 
865 870 875 880 

Gin Thr Asn Glu Trp Asn Ser Ser Ala Gin Gin Cys Leu Thr lie Asn 
885 890 895 

Thr Ala Thr Gly Ala Trp Thr val Ser Thr Ala Asn Phe Ser Gly Gly 
900 905 910 

Thr Gly Gly Ala Pro Ala Thr Tyr Pro Ser lie Tyr Lys Gly Cys His 
915 920 925 

Trp Gly Asn Cys Thr Thr Lys Asn Val Gly Met Pro lie Gin lie Ser 
930 935 940 

Gin lie Gly Ser Ala val Thr Ser Trp ser Thr Thr Gin val Ser Ser 
945 950 955 960 



Gly Ala Tyr Asp val Ala Tyr Asp lie Trp Thr Asn ser Thr Pro Thr 
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970 



975 



Thr Thr Gly Gin Pro Asn Gly Thr Glu lie Met lie Trp Leu Asn Ser 
980 985 990 

Arg Gly Gly val Gin Pro Phe Gly Ser Gin Thr Ala Thr Gly Val Thr 
995 1000 1005 

val Ala Gly His Thr Trp Asn Val Trp Gin Gly Gin Gin Thr Ser 
1010 1015 1020 

Trp Lys lie lie Ser Tyr Val Leu Thr Pro Gly Ala Thr Ser lie 
1025 1030 1035 

Ser Asn Leu Asp Leu Lys Ala lie Phe Ala Asp Ala Ala Ala Arg 
1040 1045 1050 

Gly Ser Leu Asn Thr Ser Asp Tyr Leu Leu Asp Val Glu Ala Gly 
1055 1060 1065 

Phe Glu lie Trp Gin Gly Gly Gin Gly Leu Gly Ser Asn Ser Phe 
1070 1075 1080 

Ser Val Ser val Thr ser Gly Thr Ser Ser Pro Thr Pro Ser Pro 
1085 1090 1095 

Ser Pro Thr Pro Thr Pro Ser Pro Thr Pro Thr Pro Ser Pro Ser 
1100 1105 1110 

Pro Thr Pro Ser Pro Ser Pro Thr Ser Ser Pro Ser Ser Ser Gly 
1115 1120 1125 

val Ala Cys Arg Ala Thr Tyr Val Val Asn Ser Asp, Trp Gly Ser 
1130 1135 1140 

Gly Phe Thr Ala Thr val Thr val Thr Asn Thr Gly Ser Arg Ala 
1145 1150 1155 

Thr Asn Gly Trp Thr Val Ala Trp Ser Phe Gly Gly Asn Gin Thr 
1160 1165 1170 

val Thr Asn Tyr Trp Asn Thr Ala Leu Thr Gin Ser Gly Ala Ser 
1175 1180 1185 

val Thr Ala Thr Asn Leu Ser Tyr Asn Asn Val lie Gin Pro Gly 
1190 1195 1200 

Gin Ser Thr Thr Phe Gly Phe Asn Gly Ser Tyr Ser Gly Thr Asn 
1205 1210 1215 

Ala Ala Pro Thr Leu Ser Cys Thr Ala Ser 
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1225 



<210> 2 

<211> 3687 

<212> DNA 

<213> Acidothermus cellulolyticus 
<220> 

<221> mi sc_feature 

<223> GuxA full-length coding sequence 

<400> 2 



atggagcgaa 


cccaacaatc 


cggacggaac 


tgcaggtacc 


agagaggaac 


gacacgaatg 


60 


cccgccatct 


caaaacggct 


gcgagccggc 


gtcctcgccg 


gggcggtgag 


catcgcagcc 


120 


tccatcgtgc 


cgctggcgat 


gcagcatcct 


gccatcgccg 


cgacgcacgt 


cgacaatccc 


180 


tatgcgggag 


cgaccttctt 


cgtcaacccg 


tactgggcgc 


aagaagtaca 


gagcgaagcg 


240 


gcgaaccaga 


ccaatgccac 


tctcgcagcg 


aaaatgcgcg 


tcgtttccac 


atattcgacg 


300 


gccgtctgga 


tggaccgcat 


cgctgcgatc 


aacggcgtca 


acggcggacc 


cggcttgacg 


360 


acatatctgg 


acgccgccct 


ctcccagcag 


cagggaacca 


cccctgaagt 


cattgagatt 


420 


gtcatctacg 


atctgccggg 


acgcgactgc 


gcggcgctcg 


cctccaacgg 


cgaactgccc 


480 


gctacggcag 


caggtttgca 


gacctatgaa 


acgcagtaca 


tcgatccgat 


tgcgagtatc 


540 


ctgagcaatc 


cgaagtactc 


cagcctgcgg 


atcgtgacga 


tcattgagcc 


ggactcgctg 


600 


ccaaacgcgg 


tcaccaatat 


gagcattcaa 


gcgtgtgcaa 


cggcggtgcc 


gtattacgag 


660 


caaggcatcg 


agtacgcgct 


cacgaaattg 


cacgccattc 


cgaacgtgta 


catctacatg 


720 


gacgccgccc 


actccggctg 


gcttgggtgg 


cccaataatg 


ccagcggata 


cgtacaggaa 


780 


gtccagaagg 


tcctcaacgc 


gagcatcggg 


gtcaacggca 


tcgacggctt 


cgtcaccaac 


840 


acggcgaatt 


acacgccgtt 


gaaggagccg 


ttcatgaccg 


ccacccagca 


ggtcggcgga 


900 


cagccggtgg 


agtcggcgaa 


tttctaccag 


tggaatcctg 


acatcgacga 


agccgactac 


960 


gcggttgact 


tgtactcgcg 


gctcgtcgcc 


gctggctttc 


caagcagcat 


cggcatgctc 


1020 


atcgacacct 


tacgcaacgg 


ttggggtggt 


ccgaacgaac 


caacaggccc 


gagcaccgcg 


1080 


accgatgtca 


acaccttcgt 


caaccagtcg 


aagattgacc 


ttcggcagca 


ccgcggcctg 


1140 


tggtgcaacc 


agaacggtgc 


gggcctcggc 


cagccgccgc 


aggcaagccc 


gacggacttc 


1200 


ccgaacgcgc 


acctcgacgc 


gtatgtctgg 


atcaagccgc 


cgggtgagtc 


ggacggcaca 


1260 


agcgctgcga 


gcgatccgac 


aactggcaag 


aagtcggacc 


ccatgtgcga 


cccgacgtac 


1320 


acgacgtcgt 


acggggtact 


gaccaacgcg 


ttaccgaact 


ccccgatcgc 


cggccagtgg 


1380 


ttcccggcgc 


agtttgacca 


gcttgtcgcg 


aacgcacggc 


cagcggtgcc 


gacgtcgacc 


1440 


agctcgagcc 


cgccgcctcc 


gccgccgagt 


ccgtcggctt 


cgccgagtcc 


gagcccgagt 


1500 


ccgagcccga 


gcagctcgcc 


atcgccgtcg 


ccgtctccga 


gctcgagccc 


gtctccgtcg 


1560 


ccgagcccga 


gtccgagccc 


gagtagctcg 


ccgtcgccgt 


ctccgagctc 


gagcccgtct 


1620 
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CCy LCgCCCJd 


gcccgagtcc 


sequence listing.txt 
gageccgagt agctcgccgt cgccgtctcc 


y ay l. l l. y a y l. 


1680 


LLg ll LCLg l 


LgcLydgcLL 


gagtccgagc 


ccgagtagct 


cgccgtcgcc 


y V- v_ l v_ v.. y a l. y 


1740 


ucg xcgccgg 


Lcgggugg 


gctgaaggtg 


cagtacaaga 


acaatgattc 


yy^-y^yyy L 


1800 


/-« "3 -3 *3 f /~ "a /~i *a 

yd.tad.LCdy d 


LLdddLLyy y 


tctccagttg 


gtgaataccg 


ggtcgtcgtc 


yy i-yyc*- *- *-y 


1860 


i- /ti "3 /~ n n "t" n "3 

1-L.y dcy y ty d 


LyyuyLyyid 


ctggttcacc 


cgggatggtg 


ggtegtcgae 


av. *~yy Ly lql 




aaL Ly LydL L 


gggcggcg*^ 


ggggtgtggg 


aatatccgcg 


cctcgttcgg 


v_ Lv_yy L^aaL 


1980 

JU^/L> \J 


LLyyLya.L.yi» 


LydLyyLyyd 


cacctacctg 


cagttgtcgt 


tcactggtgg 


a a V— y L 


2040 


nrtnntnnnt 

gcxggxgggx: 


LydLyyy tyd 


gattcaaaac 


egggtgaata 


agagtgactg 


y l l. y a a l l. lv. 


2100 


aLLyayaLLa 


dLydLLdLLL 


gtatgggacg 


aacaccacct 


tccaggactg 


ya\_yaayy Ly 


2160 


dcyy ty LdLy 


LLddLyyLy l 


gttggtgtgg 


gggactgaac 


cgtccggcac 


L,ayL.L-L.V_.ayL. 


2220 


CLLdLdLLaL 


LLLLydy LLL 


gagcccgagc 


ccgagcccgg 


gtggggatgt 


fip. c nr r nr r n 
yaL.yL.L,yL.v,y 


2280 


ag^g LyLLyd 


LLyyLL Lyy l 


ggtgacgggg 


gtgagtgggt 


cgtcggtgtc 


nt"l"finrnl"nn 

y L L yy*-y L yy 


2340 


aa tgcg LCga 


Ly yd LddLy L 


gggggtggcg 


cattacaacg 


tgtaccgcaa 


^-yyyy L 9 L L y 


2400 


gLgggccagc 


LydLy y LydL 


ctcgttcacc 


gaeaegggtt 


tggccgcggg 


aaL.v_y L.y l a v_ 


2460 


dLL LaCdCyg 


LyyLLyLyyt 


ggacgctgcg 


ggtaacacct 


ccgccccatc 


r 7\ r r r r f*n1~ c 


2S20 


^ r r~ r\ r r ^ f f 

aCCgCCaCCa 


LLdLydy LLL 


cagccccagc 


cccacgccga 


cggggaccac 


yy LLaLLyaL 


2SR0 


tgcacgcccg 


gLLL LddLLd 


gaatggtgtg 


accagcgtgc 


agggegaega 


d LdLLyyy Ly 


2640 


CdgdCCaa Lg 


d.g tggdd L LL 


gtcggcccag 


cagtgcctca 


ccatcaatac 


LyLya.v_L.yy l 


2700 


gcc Lggcicgg 


Lydy LdL Ly L 


gaacttcagc 


ggtgggaccg 


gcggtgcgcc 


i_y i_y ai»y l a l 


2760 


ccg Lcgci xc u 


-3 /- -3 annnrtn 

dLddy yy LLy 


ccactggggc 


aactgcacca 


egaagaaegt 


r nnnpt nr r n 
Lyyya LyLLy 


2820 


d L LLdgd LLd 


9 LLdydL ty y 


ttcggctgtg 


acgtcgtgga 


gtacgacgea 


nn+ fit - c 'fit" c n 
yy *~y *-*-y *-y 


2880 


99LgcgLdLg 


dLy LyyLL id 


cgacatttgg 


acgaacagta 


ccccaacgac 


a a l. L.y y l l. ay 


2Q40 


LLdddLyy Ld 


LLyddd LLd L 


gatttggctg 


aattcgcgtg 


gtggggtgca 


y L.L,y l LL.yyy 


^000 


LLyLdydLdy 


LydLyyy Ly l 


gaeggtcget 


ggtcacacgt 


ggaatgtctg 


y Layy y LLay 


3060 


LdydLL LLy L 


yy ddy d l id l 


ttcctacgtc 


ctgacccccg 


gtgcgacgtc 


ya.LL.ay l aa l 


31 20 


LtyydLLLyd 


dy y Ly d llll 


cgcggacgcc 


gcggcacgcg 


ggtcgctcaa 


LuLL LL.L,ya L 


3180 

J XL* V/ 


Lace xgc LLy 


dLy l Lydy yL 


egggtttgag 


atctggcaag 


gtggtcaggg 


rrtnnnrsnr 
ll LyyyLdyL 


3240 


aaC LCgt iCa 


gcg ll LLLgL 


gaegagegge 


acgtccagcc 


cgacaccgag 


LLLydyLLLy 


3300 


dcgccydCdc 


Lydy L LLy dL 


gccgacaccg 


tctccgagcc 


cgaccccgtc 


yLLyay LLLy 


3360 


accagc xcgc 


eg Lcgxcg LL 


gggtgtggcg 


tgeegggega 


cgtatgtggt 


y dd Ldy LydL 


3470 


tggggttctg 


ggtttacggc 


gacggtgacg 


gtgacgaata 


ccgggagccg 


ggegacgaac 


3480 


gggtggacgg 


tggcgtggtc 


gtttggtggg 


aatcagaegg 


tcacgaacta 


ctggaacact 


3540 


gcgttgaccc 


aatcaggtgc 


atcggtgacg 


gcgacgaacc 


tgagttacaa 


caaegtgate 


3600 


caaccgggtc 


agtcgaccac 


etteggatte 


aacggaagtt 


actcaggaac 


aaacgccgcg 


3660 
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ccgacgctca gctgcacagc cagctga 



3687 



<210> 3 

<211> 53 

<212> PRT 

<213> Acidothermus cellulolyticus 
<220> 

<221> mi sc_f eature 

<223> potential signal peptide 

<400> 3 

Met Glu Arg Thr Gin Gin Ser Gly Arg Asn Cys Arg Tyr Gin Arg Gly 
15 10 15 

Thr Thr Arg Met Pro Ala lie Ser Lys Arg Leu Arg Ala Gly val Leu 
20 25 30 

Ala Gly Ala Val Ser lie Ala Ala Ser lie Val Pro Leu Ala Met Gin 
35 40 45 

His Pro Ala lie Ala 
50 



<210> 4 

<211> 423 

<212> PRT 

<213> Acidothermus cellulolyticus 
<220> 

<221> mi sc_f eature 

<223> sequence of GH6 domain 

<400> 4 

Ala Thr His Val Asp Asn Pro Tyr Ala Gly Ala Thr Phe Phe val Asn 
15 10 15 

Pro Tyr Trp Ala Gin Glu val Gin ser Glu Ala Ala Asn Gin Thr Asn 
20 25 30 

Ala Thr Leu Ala Ala Lys Met Arg val val Ser Thr Tyr Ser Thr Ala 
35 40 45 

val Trp Met Asp Arg lie Ala Ala lie Asn Gly val Asn Gly Gly Pro 
50 55 60 

Gly Leu Thr Thr Tyr Leu Asp Ala Ala Leu Ser Gin Gin Gin Gly Thr 
65 70 75 80 

Thr Pro Glu Val lie Glu lie val lie Tyr Asp Leu Pro Gly Arg Asp 
85 90 95 



Cys Ala Ala Leu Ala Ser Asn Gly Glu Leu Pro Ala Thr Ala Ala Gly 
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105 110 



Leu Gin Thr Tyr Glu Thr Gin Tyr lie Asp Pro lie Ala Ser lie Leu 
115 120 125 

Ser Asn Pro Lys Tyr Ser Ser Leu Arg lie val Thr lie lie Glu Pro 
130 135 140 

Asp Ser Leu Pro Asn Ala Val Thr Asn Met Ser lie Gin Ala Cys Ala 
145 150 155 160 

Thr Ala val Pro Tyr Tyr Glu Gin Gly lie Glu Tyr Ala Leu Thr Lys 
165 170 175 

Leu His Ala lie Pro Asn val Tyr lie Tyr Met Asp Ala Ala His Ser 
180 185 190 

Gly Trp Leu Gly Trp Pro Asn Asn Ala Ser Gly Tyr val Gin Glu val 
195 200 205 

Gin Lys Val Leu Asn Ala Ser lie Gly Val Asn Gly lie Asp Gly Phe 
210 215 220 

val Thr Asn Thr Ala Asn Tyr Thr Pro Leu Lys Glu Pro Phe Met Thr 
225 230 235 240 

Ala Thr Gin Gin val Gly Gly Gin Pro val Glu Ser Ala Asn Phe Tyr 
245 250 255 

Gin Trp Asn Pro Asp lie Asp Glu Ala Asp Tyr Ala val Asp Leu Tyr 
260 265 270 

Ser Arg Leu Val Ala Ala Gly Phe Pro Ser Ser lie Gly Met Leu lie 
275 280 285 

Asp Thr Leu Arg Asn Gly Trp Gly Gly Pro Asn Glu Pro Thr Gly Pro 
290 295 300 

Ser Thr Ala Thr Asp val Asn Thr Phe Val Asn Gin ser Lys lie Asp 
305 310 315 320 

Leu Arg Gin His Arg Gly Leu Trp Cys Asn Gin Asn Gly Ala Gly Leu 
325 330 335 

Gly Gin Pro Pro Gin Ala Ser Pro Thr Asp Phe Pro Asn Ala His Leu 
340 345 350 

Asp Ala Tyr val Trp lie Lys Pro Pro Gly Glu Ser Asp Gly Thr Ser 
355 360 365 



Ala Ala Ser Asp Pro Thr Thr Gly Lys Lys ser Asp Pro Met Cys Asp 
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Sequence listing.txt 
370 375 380 

Pro Thr Tyr Thr Thr Ser Tyr Gly val Leu Thr Asn Ala Leu Pro Asn 
385 390 395 400 

Ser Pro lie Ala Gly Gin Trp Phe Pro Ala Gin Phe Asp Gin Leu val 
405 410 415 

Ala Asn Ala Arg Pro Ala Val 
420 



<210> 5 

<211> 150 

<212> PRT 

<213> Acidothermus cellulolyticus 
<220> 

<221> misc_feature 

<223> Sequence of CBD III domain 

<400> 5 

val Ser Gly Gly Leu Lys val Gin Tyr Lys Asn Asn Asp Ser Ala Pro 
15 10 15 

Gly Asp Asn Gin lie Lys Pro Gly Leu Gin Leu Val Asn Thr Gly Ser 
20 25 30 

Ser Ser val Asp Leu Ser Thr val Thr val Arg Tyr Trp Phe Thr Arg 
35 40 45 

Asp Gly Gly Ser Ser Thr Leu Val Tyr Asn Cys Asp Trp Ala Ala Met 
50 55 60 

Gly Cys Gly Asn lie Arg Ala Ser Phe Gly Ser val Asn Pro Ala Thr 
65 70 75 80 

Pro Thr Ala Asp Thr Tyr Leu Gin Leu Ser Phe Thr Gly Gly Thr Leu 
85 90 95 

Ala Ala Gly Gly Ser Thr Gly Glu lie Gin Asn Arg Val Asn Lys Ser 
100 105 110 

Asp Trp Ser Asn Phe Thr Glu Thr Asn Asp Tyr Ser Tyr Gly Thr Asn 
115 120 125 

Thr Thr Phe Gin Asp Trp Thr Lys val Thr val Tyr val Asn Gly Val 
130 135 140 

Leu val Trp Gly Thr Glu 
145 150 



<210> 6 
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<211> 85 

<212> PRT 

<213> Acidothermus cellulolyticus 
<220> 

<221> misc_feature 

<223> FN ill domain of GuxA 

<400> 6 

Asp val Thr Pro Pro Ser val Pro Thr Gly Leu val Val Thr Gly Val 
15 10 15 

Ser Gly ser Ser val Ser Leu Ala Trp Asn Ala Ser Thr Asp Asn Val 
20 25 30 

Gly Val Ala His Tyr Asn val Tyr Arg Asn Gly Val Leu val Gly Gin 
35 40 45 

Pro Thr val Thr Ser Phe Thr Asp Thr Gly Leu Ala Ala Gly Thr Ala 
50 55 60 

Tyr Thr Tyr Thr val Ala Ala Val Asp Ala Ala Gly Asn Thr Ser Ala 
65 70 75 80 

Pro Ser Thr Pro val 
85 



<210> 7 

<211> 231 

<212> PRT 

<213> Acidothermus cellulolyticus 
<220> 

<221> mi sc_f eature 

<223> GH12 domain of GuxA 

<400> 7 

Asp Cys Thr Pro Gly Pro Asn Gin Asn Gly val Thr Ser val Gin Gly 
15 10 15 

Asp Glu Tyr Arg Val Gin Thr Asn Glu Trp Asn Ser Ser Ala Gin Gin 
20 25 30 

Cys Leu Thr lie Asn Thr Ala Thr Gly Ala Trp Thr val Ser Thr Ala 
35 40 45 

Asn Phe Ser Gly Gly Thr Gly Gly Ala Pro Ala Thr Tyr Pro Ser lie 
50 55 60 

Tyr Lys Gly Cys His Trp Gly Asn Cys Thr Thr Lys Asn val Gly Met 
65 70 75 80 

Pro lie Gin lie Ser Gin lie Gly Ser Ala val Thr Ser Trp Ser Thr 
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85 90 95 

Thr Gin val Ser Ser Gly Ala Tyr Asp val Ala Tyr Asp lie Trp Thr 
100 105 110 

Asn Ser Thr Pro Thr Thr Thr Gly Gin Pro Asn Gly Thr Glu lie Met 
115 120 125 

lie Trp Leu Asn Ser Arg Gly Gly val Gin Pro Phe Gly Ser Gin Thr 
130 135 140 

Ala Thr Gly val Thr val Ala Gly His Thr Trp Asn val Trp Gin Gly 
145 150 155 160 

Gin Gin Thr Ser Trp Lys lie lie ser Tyr val Leu Thr Pro Gly Ala 
165 170 175 

Thr Ser lie Ser Asn Leu Asp Leu Lys Ala lie Phe Ala Asp Ala Ala 
180 185 190 

Ala Arg Gly Ser Leu Asn Thr Ser Asp Tyr Leu Leu Asp Val Glu Ala 
195 200 205 

Gly Phe Glu lie Trp Gin Gly Gly Gin Gly Leu Gly Ser Asn Ser Phe 
210 215 220 

Ser Val Ser Val Thr Ser Gly 
225 230 



<210> 8 

<211> 101 

<212> PRT 

<213> Acidothermus cellulolyticus 
<220> 

<221> mi sc_f eature 

<223> Sequence of CBD II domain 

<400> 8 

Gly val Ala Cys Arg Ala Thr Tyr val val Asn Ser Asp Trp Gly Ser 
15 10 15 

Gly Phe Thr Ala Thr Val Thr val Thr Asn Thr Gly Ser Arg Ala Thr 
20 25 30 

Asn Gly Trp Thr Val Ala Trp Ser Phe Gly Gly Asn Gin Thr val Thr 
35 40 45 

Asn Tyr Trp Asn Thr Ala Leu Thr Gin Ser Gly Ala Ser val Thr Ala 
50 55 60 

Thr Asn Leu Ser Tyr Asn Asn Val lie Gin Pro Gly Gin Ser Thr Thr 
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70 



Sequence "listing.txt 
75 



80 



Phe Gly Phe Asn Gly Ser Tyr Ser Gly Thr Asn Ala Ala Pro Thr Leu 
85 90 95 

Ser Cys Thr Ala Ser 
100 



<210> 9 
<211> 423 
<212> PRT 

<213> Acidothermus cellulolyticus 
<220> 

<221> mi sc_feature 

<223> GH6 domain of Acidothermus cellulolyticus 
<400> 9 

Ala Thr His val Asp Asn Pro Tyr Ala Gly Ala Thr Phe Phe val Asn 
15 10 15 

Pro Tyr Trp Ala Gin Glu val Gin Ser Glu Ala Ala Asn Gin Thr Asn 
20 25 30 

Ala Thr Leu Ala Ala Lys Met Arg val val Ser Thr Tyr Ser Thr Ala 
35 40 45 

val Trp Met Asp Arg lie Ala Ala lie Asn Gly val Asn Gly Gly Pro 
50 55 60 

Gly Leu Thr Thr Tyr Leu Asp Ala Ala Leu Ser Gin Gin Gin Gly Thr 
65 70 75 80 

Thr Pro Glu Val lie Glu lie Val lie. Tyr Asp Leu Pro Gly Arg Asp 
85 90 95 

Cys Ala Ala Leu Ala Ser Asn Gly Glu Leu Pro Ala Thr Ala Ala Gly 
100 105 110 

Leu Gin Thr Tyr Glu Thr Gin Tyr lie Asp Pro lie Ala Ser lie Leu 
115 120 125 

Ser Asn Pro Lys Tyr Ser Ser Leu Arg lie val Thr lie lie Glu Pro 
130 135 140 

Asp Ser Leu Pro Asn Ala val Thr Asn Met Ser lie Gin Ala Cys Ala 
145 150 155 160 

Thr Ala val Pro Tyr Tyr Glu Gin Gly lie Glu Tyr Ala Leu Thr Lys 
165 170 175 

Leu His Ala lie Pro Asn val Tyr He Tyr Met Asp Ala Ala His Ser 
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180 185 190 

Gly Trp Leu Gly Trp Pro Asn Asn Ala Ser Gly Tyr val Gin Glu val 
195 200 205 

Gin Lys Val Leu Asn Ala Ser lie Gly val Asn Gly He Asp Gly Phe 
210 215 220 

Val Thr Asn Thr Ala Asn Tyr Thr Pro Leu Lys Glu Pro Phe Met Thr 
225 230 235 240 

Ala Thr Gin Gin Val Gly Gly Gin Pro val Glu Ser Ala Asn Phe Tyr 
245 250 255 

Gin Trp Asn Pro Asp lie Asp Glu Ala Asp Tyr Ala Val Asp Leu Tyr 
260 265 270 

Ser Arg Leu val Ala Ala Gly Phe Pro ser ser lie Gly Met Leu lie 
275 280 285 

Asp Thr Leu Arg Asn Gly Trp Gly Gly Pro Asn Glu Pro Thr Gly Pro 
290 295 300 



Ser Thr Ala Thr Asp Val Asn Thr Phe Val Asn Gin Ser Lys lie Asp 
305 310 315 320 

Leu Arg Gin His Arg Gly Leu Trp Cys Asn Gin Asn Gly Ala Gly Leu 
325 330 335 

Gly Gin Pro Pro Gin Ala Ser Pro Thr Asp Phe Pro Asn Ala His Leu 
340 . 345 350 

Asp Ala Tyr val Trp lie Lys Pro Pro Gly Glu ser Asp Gly Thr Ser 
355 360 365 

Ala Ala Ser Asp Pro Thr Thr Gly Lys Lys Ser Asp Pro Met Cys Asp 
370 375 380 

Pro Thr Tyr Thr Thr Ser Tyr Gly val Leu Thr Asn Ala Leu Pro Asn 
385 390 395 400 

Ser Pro lie Ala Gly Gin Trp Phe Pro Ala Gin Phe Asp Gin Leu Val 
405 410 415 

Ala Asn Ala Arg Pro Ala val 
420 

<210> 10 
<211> 430 
<212> PRT 

<213> Cellulomonas fimi 
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<220> 

<221> mi sc_f eature 

<223> CBHA domain of Cellulomonas fimi 
<400> 10 

Ala Pro Val His Val Asp Asn Pro Tyr Ala Gly Ala Val Gin Tyr val 
15 10 15 

Asn Pro Thr Trp Ala Ala Ser val Asn Ala Ala Ala Gly Arg Gin Ser 
20 25 30 

Ala Asp Pro Ala Leu Ala Ala Lys Met Arg Thr val Ala Gly Gin Pro 
35 40 45 

Thr Ala val Trp Met Asp Arg lie Ser Ala lie Thr Gly Asn Ala Asp 
50 55 60 

Gly Asn Gly Leu Lys Phe His Leu Asp Asn Ala Val Ala Gin Gin Lys 
65 , 70 75 80 

Ala Ala Gly val Pro Leu val Phe Asn Leu val lie Tyr Asp Leu Pro 
85 90 95 

Gly Arg Asp Cys Phe Ala Leu Ala Ser Asn Gly Glu Leu Pro Ala Thr 
100 105 110 

Asp Ala Gly Leu Ala Arg Tyr Lys Ser Glu Tyr lie Asp Pro lie Ala 
115 120 125 

Asp Leu Leu Asp Asn Pro Glu Tyr Glu Ser lie Arg lie Ala Ala Thr 
130 135 140 

lie Glu Pro Asp Ser Leu Pro Asn Leu Thr Thr Asn lie Ser Glu Pro 
145 150 155 160 

Ala Cys Gin Gin Ala Ala Pro Tyr Tyr Arg Gin Gly val Lys Tyr Ala 
165 170 175 

Leu Asp Lys Leu His Ala lie Pro Asn Val Tyr Asn Tyr lie Asp lie 
180 185 190 

Gly His Ser Gly Trp Leu Gly Trp Asp Ser Asn Ala Gly Pro Ser Ala 
195 200 205 

Thr Leu Phe Ala Glu Val Ala Lys Ser Thr Thr Ala Gly Phe Ala Ser 
210 215 220 

He Asp Gly Phe Val Ser Asp val Ala Asn Thr Thr Pro Leu Glu Glu 
225 230 235 240 

Pro Leu Leu Ser Asp Ser Ser Leu Thr lie Asn Asn Thr Pro lie Arg 
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250 255 



Ser ser Lys Phe Tyr Glu Trp Asn Phe Asp Phe Asp Glu lie Asp Tyr 
260 265 270 

Thr Ala His Met His Arg Leu Leu Val Ala Ala Gly Phe Pro Ser ser 
275 280 285 

lie Gly Met Leu val Asp Thr ser Arg Asn Gly Trp Gly Gly Pro Asn 
290 295 300 

Arg Pro Thr Ser lie Thr Ala Ser Thr Asp Val Asn Ala Tyr Val Asp 
305 310 . 315 320 

Ala Asn Arg val Asp Arg Arg val His Arg Gly Ala Trp Cys Asn Pro 
325 330 335 

Leu Gly Ala Gly lie Gly Arg Phe Pro Glu Ala Thr Pro Ser Gly Tyr 
340 345 350 

Ala Ala Ser His Leu Asp Ala Phe Val Trp lie Lys Pro Pro Gly Glu 
355 360 365 

Ser Asp Gly Ala Ser Thr Asp lie Pro Asn Asp Gin Gly Lys Arg Phe 
370 375 380 

Asp Arg Met Cys Asp Pro Thr Phe val Ser Pro Lys Leu Asn Asn Gin 
385 390 395 400 

Leu Thr Gly Ala Thr Pro Asn Ala Pro Leu Ala Gly Gin Trp Phe Glu 
405 410 415 

Glu Gin Phe val Thr Leu val Lys Asn Ala Tyr Pro val lie 
420 425 430 

<210> 11 
<211> 432 
<212> PRT 

<213> Thermobifida fusca 
<220> 

<221> mi sc_f eature 

<223> E3 domain of Thermobifida fusca 
<400> 11 

Pro Gly Gly Pro Thr Asn Pro Pro Thr Asn Pro Gly Glu Lys val Asp 
15 10 15 

Asn Pro Phe Glu Gly Ala Lys Leu Tyr Val Asn Pro Val Trp Ser Ala 
20 25 30 



Lys Ala Ala Ala Glu Pro Gly Gly Ser Ala Val Ala Asn Glu Ser. Thr 
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35 40 45 

Ala Val Trp Leu Asp Arg lie Gly Ala lie Glu Gly Asn Asp Ser Pro 
50 55 60 

Thr Thr Gly Ser Met Gly Leu Arq Asp His Leu Glu Glu Ala Val Arg 
65 70 75 80 

Gin Ser Gly Gly Asp Pro Leu Thr lie Gin val val lie Tyr Asn Leu 
85 90 95 

pro Gly Arg Asp Cys Ala Ala Leu Ala Ser Asn Gly Glu Leu Gly Pro 
100 105 110 

Asp Glu Leu Asp Arg Tyr Lys Ser Glu Tyr lie Asp Pro lie Ala Asp 
115 120 125 

lie Met Trp Asp Phe Ala Asp Tyr Glu Asn Leu Arg lie Val Ala lie 
130 135 140 

lie Glu lie Asp Ser Leu Pro Asn Leu val Thr Asn Val Gly Gly Asn 
145 150 155 160 

Gly Gly Thr Glu Leu Cys Ala Tyr Met Lys Gin Asn Gly Gly Tyr val 
165 170 175 

Asn Gly Val Gly Tyr Ala Leu Arg Lys Leu Gly Glu lie Pro Asn Val 
180 185 190 

Tyr Asn Tyr lie Asp Ala Ala His His Gly Trp lie Gly Trp Asp Ser 
195 200 205 

Asn Phe Gly Pro Ser val Asp lie Phe Tyr Glu Ala Ala Asn Ala Ser 
210 215 220 

Gly ser Thr val Asp Tyr val His Gly Phe lie ser Asn Thr Ala Asn 
225 230 235 240 

Tyr Ser Ala Thr Val Glu Pro Tyr Leu Asp Val Asn Gly Thr val Asn 
245 250 255 

Gly Gin Leu lie Arg Gin Ser Lys Trp Val Asp Trp Asn Gin Tyr val 
260 265 270 

Asp Glu Leu Ser Phe Val Gin Asp Leu Arg Gin Ala Leu lie Ala Lys 
275 280 285 

Gly Phe Arg Ser Asp lie Gly Met Leu lie Asp Thr Ser Arg Asn Gly 
290 295 300 

Trp Gly Gly Pro Asn Arg Pro Thr Gly Pro Ser Ser Ser Thr Asp Leu 
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305 310 315 320 

Asn Thr Tyr val Asp Glu Ser Arg lie Asp Arg Arg lie His Pro Gly 
325 330 335 

Asn Trp Cys Asn Gin Ala Gly Ala Gly Leu Gly Glu Arg Pro Thr val 
340 345 350 

Asn Pro Ala Pro Gly val Asp Ala Tyr val Trp val Lys Pro Pro Gly 
355 360 365 

Glu ser Asp Gly Ala Ser Glu Glu lie Pro Asn Asp Glu Gly Lys Gly 
370 375 380 

Phe Asp Arg Met Cys Asp Pro Thr Tyr Gin Gly Asn Ala Arg Asn Gly 
385 390 395 400 

Asn Asn Pro Ser Gly Ala Leu Pro Asn Ala Pro lie ser Gly His Trp 
405 410 , 415 

Phe Ser Ala Gin Phe Arg Glu Leu Leu Ala Asn Ala Tyr Pro Pro Leu 
420 425 430 



<210> 12 
<211> 221 
<212> PRT 

<213> streptomyces sp. 
<220> 

<221> misc_feature 

<223> streptomyces sp. 11AG8 cellulase 12A 
<400> 12 

Asn Gin Gin lie Cys Asp Arg Tyr Gly Thr Thr Thr lie Gin Asp Arg 
15 10 15 

Tyr val Val Gin Asn Asn Arg Trp Gly Thr Ser Ala Thr Gin Cys lie 
20 25 30 

Asn Val Thr Gly Asn Gly Phe Glu lie Thr Gin Ala Asp Gly Ser val 
35 40 45 

pro Thr Asn Gly Ala Pro Lys Ser Tyr Pro Ser val Tyr Asp Gly Cys 
50 55 60 

His Tyr Gly Asn Cys Ala Pro Arg Thr Thr Leu Pro Met Arg lie Ser 
65 70 75 80 

Ser He Gly Ser Ala Pro Ser Ser val Ser Tyr Arg Tyr Thr Gly Asn 
85 90 95 

Gly val Tyr Asn Ala Ala Tyr Asp lie Trp Leu Asp Pro Thr Pro Arg 
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100 105 110 

Thr Asn Gly Val Asn Arg Thr Glu lie Met lie Trp Phe Asn Arg val 
115 120 125 

Gly Pro val Gin Pro lie Gly Ser Pro Val Gly Thr Ala His val Gly 
130 135 140 

Gly Arg Ser Trp Glu val Trp Thr Gly Ser Asn Gly Ser Asn Asp val 
145 150 155 160 

lie Ser Phe Leu Ala Pro Ser Ala lie Ser Ser Trp Ser Phe Asp val 
165 170 175 

Lys Asp Phe Val Asp Gin Ala val Ser His Gly Leu Ala Thr Pro Asp 
180 185 190 

Trp Tyr Leu Thr Ser lie Gin Ala Gly Phe Glu Pro Trp Glu Gly Gly 
195 200 205 

Thr Gly Leu Ala val Asn Ser Phe Ser ser Ala val Asn 
210 215 220 . 



<210> 13 

<211> 221 

<212> PRT 

<213> Streptomyces lividans 
<220> 

<221> misc_feature 

<223> Streptomyces lividans cellulase B 

<400> 13 

Asp Thr Thr lie Cys Glu Pro Phe Gly Thr Thr Thr lie Gin Gly Arg 
1 5 10 15 

Tyr val val Gin Asn Asn Arg Trp Gly Ser Thr Ala Pro Gin Cys Val 
20 25 30 

Thr Ala Thr Asp Thr Gly Phe Arg val Thr Gin Ala Asp Gly Ser Ala 
35 40 45 

Pro Thr Asn Gly Ala Pro Lys Ser Tyr Pro Ser Val Phe Asn Gly Cys 
50 55 60 

His Tyr Thr Asn Cys Ser Pro Gly Thr Asp Leu Pro val Arg Leu Asp 
65 70 75 80 

Thr val Ser Ala Ala Pro ser ser lie ser Tyr Gly Phe Val Asp Gly 
85 90 95 

Ala val Tyr Asn Ala Ser Tyr Asp lie Trp Leu Asp Pro Thr Ala Arg 
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100 105 110 

Thr Asp Gly Val Asn Gin Thr Glu lie Met lie Trp Phe Asn Arg Val 
115 120 125 

Gly Pro lie Gin Pro lie Gly Ser Pro val Gly Thr Ala Ser Val Gly 
130 135 140 

Gly Arg Thr Trp Glu Val Trp Ser Gly Gly Asn Gly Sen Asn Asp Val 
145 150 155 160 

Leu Ser Phe Val Ala Pro Ser Ala lie ser Gly Trp Ser Phe Asp val 
165 170 175 

Met Asp Phe val Arg Ala Thr Val Ala Arg Gly Leu Ala Glu Asn Asp 
180 185 190 

Trp Tyr Leu Thr Ser Val Gin Ala Gly Phe Glu Pro Trp Gin Asn Gly 
195 200 205 

Ala Gly Leu Ala val Asn Ser Phe Ser Ser Thr Val Glu 
210 215 220 



<210> 14 

<211> 228 

<212> PRT 

<213> Acidothermus cellulolyticus 
<220> 

<221> misc_feature 

<223> Acidothermus cellulolyticus GH12 domain 

<400> 14 

Cys Thr Pro Gly Pro Asn Gin Asn Gly Val Thr Ser Val Gin Gly Asp 
1 5 10 15 

Glu Tyr Arg val Gin Thr Asn Glu Trp Asn Ser Ser Ala Gin Gin Cys 
20 25 30 

Leu Thr lie Asn Thr Ala Thr Gly Ala Trp Thr val Ser Thr Ala Asn 
35 40 45 

Phe ser Gly Gly Thr Gly Gly Ala Pro Ala Thr Tyr Pro Ser lie Tyr 
50 55 60 

Lys Gly Cys His Trp Gly Asn Cys Thr Thr Lys Asn val Gly Met Pro 
65 70 75 80 

lie Gin lie ser Gin lie Gly Ser Ala val Thr Ser Trp Ser Thr Thr 
85 90 95 

Gin val Ser Ser Gly Ala Tyr Asp val Ala Tyr Asp lie Trp Thr Asn 
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110 



100 



105 



Ser Thr Pro Thr Thr Thr Gly 
115 



Gin 
120 



Pro Asn 



Gly Thr Glu lie Met lie 
125 



Trp Leu Asn Ser Arg Gly Gly val Gin Pro Phe Gly Ser Gin Thr Ala 
130 135 140 



Thr Gly Val Thr Val Ala Gly His Thr Trp Asn Val Trp Gin Gly Gin 
145 150 155 160 



Gin Thr Ser Trp Lys lie lie Ser Tyr Val Leu Thr Pro Gly Ala Thr 
165 170 175 



Ser lie Ser Asn Leu Asp Leu Lys Ala lie Phe Ala Asp Ala Ala Ala 
180 185 190 



Arg Gly Ser Leu Asn Thr Ser Asp Tyr Leu Leu Asp val Glu Ala Gly 
195 200 205 



Phe Glu lie Trp Gin Gly Gly Gin Gly Leu Gly Ser Asn Ser Phe Ser 
210 215 220 



val ser Val Thr 
225 
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